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Dear Sir: 

Prior to calculating the filing fee for the above-referenced 
patent application, please enter the following amendments: 
In the Claims: 

In claim 5, line 2, please delete "any one of the preceding 
claims" and insert therefor — claim 1--. 

In claim 7, line 2, please delete "or 6". 

In claim 8, line 2, please delete "any one of claims 1 to. 4" 
and insert therefor --claim 1 — . 

In claim 11, line 2, please delete "or 9". 

In claim 12, line 4, please delete "any one of claims 1 to 
4" and insert therefor --claim 1 — . 

In claim 15, line 3, please delete "or 13". 
Please add the following new claims: 



16. An antibody or fragment thereof which specifically 
binds to a protein according to claim 9. 

17. A method of detecting in a sample the presence of an 
effector protein for the Grb7 family of proteins, the method 
comprising reacting the sample with an antibody or fragment 
thereof according to claim 16. 

18. A method of detecting in a sample the presence of mRNA 
encoding an effector protein for the Grb7 family of proteins, the 
method comprising reacting the sample with an oligonucleotide 
probe of claim 13. 

REMARKS 

The amendments set forth above are made to simplify the 
claim dependencies. No new matter is introduced into the 
application by means of these amendments. 
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Washington, D.C. 20004 
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A EQJ'FNTTAT. FFFECTOR FDR THE HRR TJAMILX-QF SI^NALLING_ 
PROTEINS 

Field of the Invention: 

5 

The present invention relates to a novel polynucleotide molecule 
encoding a candidate effector protein for the Grb7 family of signalling 
proteins. Detection of 1he encoded protein in a tissue sample should provide 
a useful tumour marker and/or prognostic indicator. Furthermore, 
10 antagonism of the interaction between Grb7 family members and the 
encoded protein should provide a novel treatment strategy for human 
diseases exhibiting aberrant receptor tyrosine kinase (RTK) signalling (e.g. 
cancer). 

15 Background of the Invention 

RTKs play a major role in the regulation of cellular growth, 
differentiation, motility and metabolism by converting an extracellular signal 
in the form of the binding of a specific hormone or growth factor to the 

20 activation of specific signalling pathways and hence modes of intracellular 
communication (Schlessinger and Ullrich, Neuron 9. 383-391. 1992). 
Activation of RTKs results in both autophosphorylation of the receptor and 
the phosphorylation of downstream targets on tyrosine residues. It has 
become evident over the last decade that key elements in receptor-substrate 

25 and other protein-protein interactions in RTK signalling are src homology 
(SH)2 domains. SH2 domains are conserved modules of approximately 100 
amino acids found in a wide variety of signalling molecules wmich bind to 
short tyrosine-phosphorylated peptide sequences. The specificity of 
interaction is determined both by the nature of the amino acids flanking the 

30 phosphotyrosine residue in the target peptide and residues in the SH2 

domain which interact with these sites (Pawson, Nature 373. 573-580. 1995). 

SH2-domain containing proteins can be divided into two classes: those 
which possess a catalytic function (e.g. the cytoplasmic tyrosine kinase c-src 
and the t .osine phosphatase SH-PTP2) and those which consist entirely of 

35 non-catalytic protein domains (eg Grb2). the adaptor sub-class. The function 
of the latter class is to link separate catalytic subunits to a tyrosine- 
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phosphorylated receptor or signalling intermediate, and other non-catalytic 
protein modules are often involved in these interactions. For example. SH3 
and WW domains (conserved regions of approximately 50 and 40 amino 
acids, respectively) bind proline-rich peptide ligands. and pleckstrin 
homology domains (approximately 100 amino acids) interact with both 
specific phospholipid and protein targets (Pawson, 1995 supra). 

The Grb7 family represents a family of SH2 domain-containing 
adaptors which currently contains three members: Grb7. 10 and 14 (Margolis 
et al Proc. Natl Acad. Sci. USA 89, 8894-8898. 1992: Stein et al EMBO J 13. 
1331-1340. 1994: Ooi et al Oncogene 10. 1621-1630. 1995: Daly et al J. Biol 
Chem. 271. 12502-12510, 1996). These proteins share a common overall 
architecture, consisting of an N-terminal region containing a highly 
conserved proline-rich decapeptide motif, a central region harbouring a PH 
domain and a C- terminal SH2 domain. The central region of approximately 
300 amino acids bears significant homology to the C. elegans protein miglO, 
which is required for long range neuronal migration in embryos, otherwise 
the Grb7 family and miglO are structurally distinct. However, they exhibit 
differences in both SH2 selectivity towards RTKs (Janes et al. J. Biol. Chem. 
272, 8490-8497, 1997) and tissue distribution. The family has therefore 
evolved to link particular receptors to downstream effectors in a tissue- 
specific manner. Interestingly, the genes encoding this family appear to have 
co-segregated with ERBB family genes during evolution. Thus GRB7. 10 and 
14 are linked to ERBB 2. ERBBl (epidermal growth factor receptor) and ERBB4, 
respectively (Stein et al 1994 supra; Ooi et al, 1995 supra: Baker et al. 
Genomics 36, 218-220. 1996). The juxtaposition of GRB7 and ERBB2 leads to 
common co-amplification in human breast cancers, and since the two gene 
products are functionally linked, likely up-regulation of an undefined erbB2 
signalling pathway. Furthermore, GRB14 also exhibits differential expression 
in human breast cancers (Daly et al, 1996 supra). These two proteins may 
therefore modulate RTK signalling in this disease. 

In order to identify proteins which bind to this family and therefore 
identify candidate effectors, we performed a genetic screen using the yeast 
two hybrid system and Grbl4 "bait". This application describes the cloning 
and characterization of a novel interacting protein, currently designated 
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Disclosure of the Invention: 

Thus, in a first aspect, the present invention provides an isolated 
polynucleotide molecule encoding a candidate effector protein for the Grb7 
5 family of signalling proteins, wherein the polynucleotide molecule comprises 
a nucleotide sequence having at least 75% sequence identity to that shown as 
SEQ ID NO: 1. 

Preferably, the polynucleotide molecule comprises a nucleotide 
sequence having at least 85%, more preferably at least 95%. sequence 
10 identity to that shown as SEQ ID NO: 1. Most preferably, the polynucleotide 
molecule comprises a nucleotide sequence encoding a polypeptide 
comprising an amino acid sequence substantially corresponding to that 
shown as SEQ ID NO: 2. 

In a preferred embodiment of the invention of the first aspect, the 
15 polynucleotide molecule comprises a nucleotide sequence which 
substantially corresponds to that shown as SEQ ID NO: 1. 

The polynucleotide molecule may be a dominant negative mutant 
which encodes a gene product causing an altered phenotype by. for example, 
reducing or eliminating the activity of endogenous effector proteins of the 
20 Grb7 family of signalling proteins. 

The polynucleotide molecule may be incorporated into plasmids or 
expression vectors (including viral vectors), which may then be introduced 
into suitable host cells such as bacterial, yeast, insect and mammalian host 
cells. Such host cells may be used to express the protein encoded by the 
25 polynucleotide molecule. 

Accordingly, in a second aspect, the present invention provides a host 
cell transformed with the polynucleotide molecule of the first aspect. 

In a third aspect, the present invention provides a method of producing 
a protein, comprising culturing the host cell of the second aspect under 
30 conditions suitable for the expression of the polynucleotide molecule and 
optionally recovering the protein. 

Preferably, the host cell is mammalian or of insect origin. Where the 
cell is mammalian, it is presently preferred that it be a Chinese hamster 
ovary (CHO) cell or human embryonic kidney (HEK) 293 cell. Where the 
35 host cell is of insect origin, it is presently preferred thai it be an insect Sf9 
cell. 
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In a fourth aspect, the present invention provides a purified protein 
encoded by the polynucleotide molecule of the first aspect. 

In a preferred embodiment of this aspect, the purified protein 
comprises an amino acid sequence substantially corresponding to that shown 
5 asSEQIDNO:2. 

In a fifth aspect, the present invention provides a fusion protein 
comprising an amino acid sequence substantially corresponding to that 
shown as SEQ ID NO: 2. 

Fusion proteins according to the fifth aspect may include an N- 
10 terminal fragment of a protein such as (3-galactosidase to assist in the 

expression and selection of host cells expressing candidate effector protein, 
or may include a functional fragment of any other suitable protein to confer 
additional activity(ies). 

In a sixth aspect, the present invention provides an antibody oi- 
ls fragment thereof which specifically binds to the protein of the fourth aspect. 
The antibody may be monoclonal or polyclonal, however, it is 
presently preferred that the antibody is a monoclonal antibody. Suitable 
antibody fragments include Fab, F(ab') 2 and scFv. 

In a seventh aspect, the present invention provides an oligonucleotide 
20 probe comprising a nucleotide sequence of at least 12 nucleotides, the 
oligonucleotide probe comprising a nucleotide sequence such that the 
oligonucleotide probe selectively hybridises to the polynucleotide molecule 
of the first aspect under high stringency conditions (Sambrook et al,. 
Molecular Cloning: a Laboratory Manual. Second Edition. Cold Spring Harbor 
25 Laboratory Press). 

In a preferred embodiment of this aspect, the oligonucleotide probe is 
labelled. In a further preferred embodiment of this aspect, the 
oligonucleotide probe comprises a nucleotide sequence of at least 18 
nucleotides. 

30 In a « eighth aspect, the present invention provides a method of 

detecting in a sample the presence of an effector protein for the Grb7 family 
of proteins, the method comprising reacting the sample with an antibody or 
fragment thereof the sixth aspect, and detecting the binding of the antibody 
or fragment thereof. 



WO 99/15647 



PCT/AU98/00795 



The method of the eighth aspect may be conducted using any 
immunoassays well known in the art (e.g. ELISA). The sample may be. for 
example, a cell lysate or homogenate prepared from a tissue biopsy. 

In a ninth aspect, the present invention provides a method of detecting 
5 in a sample the presence of mRNA encoding an effector protein for the Grb7 
family of proteins, the method comprising reacting the sample with an 
oligonucleotide probe of the seventh aspect, and detecting the binding of the 
probe. 

The method of the ninth aspect may be conducted using any 
10 hybridisation assays well known in the art (e.g. Northern blot). The sample 
may be a poly(A) RNA preparation or homogenate prepared from a tissue 
biopsy. 

Grb7 family proteins exhibit differential expression in certain human 
cancers (particularly breast and prostate cancer) and may therefore be 
15 involved in tumour progression. Detection of the protein encoded by the 
cDNA 2.2412 in a sample should provide a useful tumour marker and/or 
prognostic indicator for these cancers. Furthermore, the interaction of Grb7 
family members with 2.2412 may provide a novel target for therapeutic 
intervention. 

20 It is to be understood that methods of detecting suitable agonists and 

methods of therapy utilising detected agonists also form part of the present 
invention. 

The term "substantially corresponds" as used herein in relation to the 
nucleotide sequence shown as SEQ ID NO: 1 is intended to encompass minor 

25 variations in the nucleotide sequence which due to degeneracy in the DNA 
code do not result in a change in the encoded protein. Further, this term is 
intended to encompass other minor variations in the sequence which may be 
required to enhance expression in a particular system but in which the 
variations do not result in a decrease in biological activity of the encoded 

30 protein. 

The term "substantially corresponding" as used herein in relation to the 
amino acid sequences shown as SEQ ID NO: 2 is intended to encompass 
minor variations in the amino acid sequences which do not result in a 
decrease in biological activity of the protein. These variations may include 
35 conservative amino acid substitutions. The substitutions envisaged are:- 

G, A, V. I, L. M: D. E; N, Q; S, T; K. R. H: F. Y. W, H: and 
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P. Na-alkalamino acids. 

The terms "comprise", "comprises" and "comprising" as used 
throughout the specification are intended to refer to the inclusion of a stated 
step, component or feature of group of steps, components of features with or 
5 without the inclusion of a further step, component or feature or group of 
steps, components or features. 

The invention will hereinafter be described with reference to the 
accompanying figure and the following, non-limiting example. 

10 Brief description of the accompanying figure: 

Figure 1 provides the nucleotide and amino acid (single letter code) 
sequence of 2.2412. Numbers refer to distances in base pairs. Ankyrin-type repeat 
sequences are underlined. An additional repeat sequence is indicated by italics. 
15 The stop codon is represented by an asterisk. The original cDNA clone 2.2412 
isolated by the two hybrid screen spans nucleotides 694-2664 of this sequence. 

Figure 2 provides a map of the 2.2412-binding region on Grbl4. 

A. Structure of the deletion constructs used in the analysis. Gal4 DNA-BD fusion 
r 20 constructs encoding full length Grbl4 (FL). the N-terminal (N). central region (C) 

and N-terminal 4- central region (N + C) were generated in the vector pAS2.1. 

B. Results of 0-galactosidase activity assays following transformation of the above 
plasmids into yeast strain Y190 together with the original 2.2412 cDNA clone in 
pACT-2. 

25 

Example: CLONING AND CHARACTERISATION OF 2.2412 

Yeast two hybrid screen 

The yeast two hybrid system exploits protein-protein interactions to 

30 reconstitute a functional transcriptional activator which can then be detected 
using a gene reporter system (Fields and Sternglanz. TIG. 10. 286-292. 1994). 
The technique takes advantage of the properties of the Gal4 protein of the 
yeast S. cerevisiae. The Gal4 DNA binding domain (DNA-BD) or activation 
domain (AD) alone are incapable of inducing transcription. However, an 

35 interaction between two proteins synthesized as DNA-BD- and AD-fusions. 
respectively, brings the Gal4 domains into close proximity and results in 
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transcriptional activation of two reporter genes [H1S3 and LacZ) which can be 
monitored by growth on selective medium and biochemical assays. 

A plasmid construct encoding a Gal4 DNA-BD-Grbl4 fusion was 
generated as follows. The plasmid GRBl 4/pRcCMVF containing full length 
5 GRBl 4 cDNA (Daly et al. 1996) was restricted with Hindlll and Klenow 
treated to create blunt ends, and then digested with Bell to release three 
fragments of approximately 1.1, 4.2 and 1.7 kb. The 1.7 kb fragment was 
isolated and cloned into the Ndel (Klenow treated) and BamHl sites of the 
yeast expression vector pAS2.1 (Clontech) to generate GRBl4/pAS2.1 
10 containing an in-frame fusion of full length Grbl4 with the GAL4 DNA-BD. 
This construct was introduced by electroporation into the yeast strain 
CG1945 [MATa, ura3-52, hisS-200, ade2-101, lys2-801,trpl-90I, leu2-3. 112, 
gal4-542, gal80-538, cyh r 2. LYS2::GALl UAS-GALt TATA-HIS3, 
URA3::GAL4i7 iners (x3)-CYClTATA-lcicZ) selecting for. tryptophan 
15 prototrophy. The expression of the fusion protein was verified by Western 
blot analysis with antibodies directed against the Flag epitope and the Gal4 
DNA-BD. The recipient strain was then grown to mid-log phase and a human 
liver cDNA library in the vector pACT2 (Clontech) introduced using the LiAc 
procedure (Schiestl and Gietz, Curr. Genet. 16. 339-346. 1989). Transformants 
20 were then selected for tryptophan, leucine and histidine prototrophy in the 
presence of 5mM 3-aminotriazole. 

From a screen of lxlO 6 clones, 39 colonies were initially selected on 
synthetic complete (SC)-leu-his-trp +3 AT medium and were then tested for 
P-galactosidase activity. 12 clones scored positive in the latter assay and were 
25 subjected to cycloheximide (CHX) curing to remove the bait plasmid by 

streaking out on SC-leu media containing lOug/ml CHX (pAS2-l contains the 
CYH2 gene which restores CHX sensitivity to CG1945 cells). This enabled 
confirmation of the bait dependency of LacZ activation and subsequent 
isolation of the pACT2 plasmids encoding interacting proteins by standard 
30 methodology (Philippsen et al. Methods in Enzymology 194, 170-177). Back 
transformations were then performed in which these pACT2 plasmids were 
introduced into CG1945 strains containing the bait plasmid {GRBl 4/pAS2-l) 
or constructs encoding non-related Gal4 DNA-BD fusions in order to confirm 
the specificity of the interactions. 
35 The DNA sequences of the cDNA inserts were then obtained by cycle 

sequencing (f-mol kit. Promega) using pACT2-specific and/or clone-specific 
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primers. Based on their nucleotide sequences the 12 interacting clones were 
classified inlo 6 independent groups (see Table I). 
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Six clones were partial cDNAs corresponding to Nedd4. a multidomain 
protein containing a calcium-dependent phospholipid binding (CaLB) 
domain, four WW domains and a C-terminal region homologous to the EG-AP 

30 carboxyl-terminus (Kumar et al Biochem. Biophys. Res. Commun. 185. 1155- 
1161, 1992: Sudol eta/ /. Biol. Chem. 270, 14733-14741. 1995; Huibregtse et al 
Proc. Natl Acad. Sci. USA 92. 2563-2567, 1995). The latter is likely to confer 
E3 ubiquitin-protein ligase activity on Nedd4. The pACT2 clones isolated 
encoded the CaLB domain together with the first 22 amino p~ds of the first 

35 WW domain. 
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Two clones encoded the intracellular region and part of the 
extracellular domain of Htk. which is a RTK of the Eph family (Bennett et al 
J. Biol. Chem. 269. 14211-14218. 1994). The recruitment of Grbl4 by Htk is of 
interest for two reasons. First, the expression profile of both Htk and the 
5 murine homologue myk-1 are indicative of a potential role in mammary 

gland development and neoplasia (Andres et al Oncogene 9. 1461-1467. 1994: 
Berclaz et al Bio chem. Biophys. Res. Comm. 226. 869-875. 1996). Second. Eph 
family members may be involved in the regulation of cell migration (Tessier- 
Lavigne. Cell 82. 345-348. 1995). which is intriguing given the homology of 

:i0 the Grb7 family to the C. elegans protein miglO (Stein et al. 1994 supra). 

A novel cDNA of 1971 bp. designated 2.2412. was also isolated. This clone 

: encoded a polypeptide of 657 amino acids in frame with the Gal4 DNA-BD. The 
cDNA did not contain a stop codon. and this, together with the Northern analysis 
described below, indicated that it was incomplete. This DNA fragment was 

-15 therefore used as a probe to screen a human placental cDNA library (5' STRETCH 
PLUS. Clontech. in XgUO). This resulted in the isolation of two clones, designated 
clone 8 and clone 12. Clone 8 was approximately 2 kb and overlapped the original 
2.2412 clone by 900 bp at the 3' end. This clone provided the carboxy-terminal 
end of the 2.2412 protein sequence (Figure 1). Clone 12 was approximately 3.5 kb 

20 and to date has provided an additional 692 bp of sequence information in the 5' 
direction. The nucleotide and protein sequence for 2.2412 provided by these 
overlapping clones is shown in Figure 1. Since a 5' initiation codon has yet to be 
identified the coding sequence still appears to be incomplete. 

25 Further characterization of 2.2412 

Database searches using the 2.2412 cDNA sequence revealed 
significant homology with a large number of proteins containing ankyrin-like 
repeats. These sequences were first identified as homologous regions 
between certain cell cycle regulatory proteins and the Drosophila protein 

30 Notch (Breeden and Nasmyth. Nature 329. 651-654. 1987) but subsequently 
they have been identified in a wide variety of other proteins where they are 
thought to function in protein-protein interactions (Bork. Proteins 17. 363- 
374. 1993). Subsequent analysis of the protein sequence identified 18 
consecutive ankyrin repeats and an additional repetitive element (Figure 1). 

35 The ankyrin repeat region is followed by a stretch of approximately 40 amino 
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acids rich in serine residues. The remaining C-terminal region has a 
relatively high content of charged amino acids. 

Northern analysis of 2.2412 mRNA expression 
5 Northern blot analysis of multiple tissue northerns (Clontech) was 

performed using the original 2.2412 cDNA as a probe. This resulted in the 
detection of a single mRNA transcript of approximately 7 kb in all tissues 
examined with the exception of the kidney. Expression was particularly high 
in skeletal muscle and placenta. The size of this transcript compared to that 
10 of the 2.2412 clone indicates that the latter represents only a partial cDNA. 

Genomic localization of the 2.2412 gene 

Fluorescence in situ hybridization of the original 2.2412 cDNA to 
normal metaphases (Baker el al. 1996 supra) and reference to the FRA10A 

15 fragile site at 10q23.32 localized the gene to between chromosome 10q23.2 
and proximal 10q23.32. Interestingly, deletions in the 10q22-25 region of 
chromosome 10 have been detected in a variety of human cancers including 
breast, prostate, renal, small cell lung and endometrial carcinomas, 
glioblastoma multiforme, melanoma and meningiomas, suggesting the 

20 presence of one or more tumour suppressive loci in this region (Li et al, 

Science 275, 1943-1947. 1997: Steck et al, Nature Genetics 15, 356-362. 1997, 
and references therein). Two candidate tumour suppressor genes have been 
identified in this region (MMAC 1/PTEN and MXIl. Li et al 1997 supra: Steck 
et al 1997 supra; Albarosa et al, Hum. Genet. 95. 709-711, 1995). 

25 

Analysis of the interaction between 2.2412 and Grb7 family members 

cDNAs encoding the full length and N- and C-terminal regions of the 
original 2.2412 cDNA clone (nucleotides 694-2664, 694-1614 and 1615-2664 
of the sequence shown in Figure 1, respectively) were cloned into the vector 

30 pGEX4T2 (Pharmacia). The full length construct was generated by 

subcloning from the pACT2 clone as a Ndel fragment, whereas the shorter 
constructs were synthesized by directional cloning of PCR products. The 
corresponding GST-fusion proteins were purified from IPTG-induced 
bacterial cultures using glutathione-agarose beads (Smith and Johnson. Gene 

35 67, 31-40, 1988). These immobilized fusion proteins were then incubated 

with lysates from cells expressing Flag epitope-tagged Grbl4 (Daly et al 1996 
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supra) or human breast cancer cells expressing high levels of Grb7 (SK-BR-3: 
Stein et al. 1994) as described previously (Daly et al. 1996). Following 
washing, bound proteins were detected by Western blot analysis. The results 
indicated that 2.2412 bound specifically to both Grbl4 and Grb7z"n vitro, and 
5 that the N-terminal fusion protein bound more strongly than that derived 

from the C-terminus. These data, obtained using a different methodology for 
detecting protein-protein interactions to the yeast two hybrid system, confirm 
that 2.2412 interacts with Grbl4. Furthermore, 2.2412 also binds Grb7. 
Consequently 2.2412 appears to represent a general effector for the Grb7 
10 family. 

Mapping of the 2.2412 binding region on Grbl4 

In order to identify the region of Grbl4 that interacts with 2.2412. a 
series of Grbl4 deletion mutants were generated by cloning PCR fragments 

15 synthesized using the appropriate flanking primers into the vector pAS2.1. 

These fragments spanned the following regions: N-termiims ("N". amino acids 
1-110), the central region ("C") encompassing the miglO homology and the 
"between PH and SH2" (BPS) domain (amino acids 110-437) and the N- 
terminal and central regions ("N + C", amino acids 1-437). These plasmids 

20 were individually transformed into the yeast strain Y190 {MATa, ura3-52, 
his3-200, ade2-101, lys2-801, trpl-901, leu2-3, 112, gal4A, galSOA, cyh r 2, 
LYS2::GAL1UAS'HIS3TATA-HIS3, UEA3::GALlUAS-GALlTATA-IacZ) and 
expression of the appropriately sized Gal4 DNA-BD fusion proteins 
confirmed by Western blotting. Following transformation of the resulting 

25 yeast strains with the original 2.2412 cDNA clone in pACT-2, the strength of 
the interaction was determined by either liquid- or filter-based p- 
galactosidase assays. The results are presented in Figure 2. and demonstrate 
that the N-terminal region of Grbl4 is not only required, but is also 
sufficient, for binding 2.2412. This supports the hypothesis that 2.2412 

30 represents a general effector for the Grb7 family, since the N-terminal region 
of these proteins contains a highly conserved proline-rich motif which may 
mediate this interaction. 
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It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are. therefore, to 
be considered in all respects as illustrative and not restrictive. 
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Sequence listings : 

SEQUENCE LISTING 

Applicant: Garvan Institute of Medical Research 

Title of Invention: A potential effector for the Grb7 family of signalling 
proteins . 



Current Application Number: 
Current Filing Date: 

Prior Application Number : P09388 

Prior Application Filing Date: 1997-09-23 

Number of ID SEQ Nos : 2 



Software: Patentln Ver. 2.0 

SEQ ID NO: 1 
Length: 3400 
Type : DNA 

Organism: Homo sapiens 



Sequence: 1 

attcctcttc ataatgcatg ctcttttggt catgctgaag tagtcaatct ccttttgcga 60 

catggtgcag accccaatgc tcgagataat tggaattata ctcctctcca tgaagctgca 120 

attaaaggaa agattgatgt ttgcattgtg ctgttacagc atggagctga gccaaccatc 180 

cgaaatacag atggaaggac agcattggat ttagcagatc catctgccaa agcagtgctt 240 

actggtgaat ataagaaaga tgaactctta gaaagtgcca ggagtggcaa tgaagaaaaa 300 

atgatggctc tactcacacc attaaatgtc aactgccacg caagtgatgg cagaaagtca 360 

actccattac atttggcagc aggatataac agagtaaaga ttgtacagct gttactgcaa 420 

catggacgtg atgtccatgc taaagataaa ggtgatctgg taccattaca caatgcctgt 480 

tcttatggtc attatgaagt aactgaactt ttggtcaagc atggtggctg tgtaaatgca 540 

atggacttgt ggcaattcac tcctcttcat gaggcagctt ctaagaacag ggttgaagta 600 

tgttctcttc tcttaagtta tggtgcagac ccaacactgc tcaattgtaa gaataaaagt 660 

gctatagact tggctcccac accacagtta aaagaaagat tagcatatga atttaaaggc 720 

cactcgttgc tgcaagctgc acgagaagct gatgttactc gaatcaaaaa acatctctct 780 

ctggaaatgg tgaatttcaa gcatcctcaa acacatgaaa cagcattgca ttgtgctgct 840 

gcatctccat atcccaaaag aaagcaaata tgtgaactgt tgctaagaaa aggagcaaac 900 

atcaatgaaa agactaaaga attcttgact cctctgcacg tggcatctga gaaagctcat 960 

aatgatgttg ttgaagtagt ggtgaaacat gaagcaaagg ttaatgctct ggataatctt 1020 

ggtcagactt ctctacacag agctgcatat tgtggtcatc tacaaacctg ccgcctactc 1080 

ctgagctatg ggtgtgatcc taacattata tcccttcagg gctttactgc tttacagatg 1140 

ggaaatgaaa atgtacagca actcctccaa gagggtatct cattaggtaa ttcagaggca 1200 

gacagacaat tgctggaagc tgcaaaggct ggagatgtcg aaactgtaaa aaaactgtgt 1260 

actgttcaga gtgtcaactg cagagacatt gaagggcgtc agtctacacc acttcatttt 1320 

gcagctgggt ataacagagt gtccgtggtg gaatatctgc tacagcatqg agctgatgtg 1380 

catgctaaag ataaaggagg ccttgtacct ttgcacaatg catgttctta cggacattat 1440 

gaagttgcag aacttcttgt taaacatgga gcagtagtta atgtagctga tttatggaaa 1500 

tttacacctt tacatgaagc agcagcaaaa ggaaaatatg aaatttgcaa acttctgctc 1560 

cagcatggtg cagaccctac aaaaaaaaac agggatggaa atactccttt ggatcttgtt 1620 

aaagatggag atacagatat tcaagatctg cttaggggag atgcagcttt gctagatgct 1680 

gccaagaagg gttgtttagc cagagtgaag aagttgtctt ctcctgataa tgtaaattgc 1740 

cgcgataccc aaggcagaca ttcaacacct ttacatttag cagctggtta taataattta 1800 

gaagttgcag agtatttgtt acaacacgga gctgatgtga atgcccaaga caaaggagga 1860 

cttattcctt tacataatgc agcatcttac gggcatgtag atgtagcaqc tctactaata 1920 
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aagtataatg catctctcaa tgccacggac aaatgggctt tcacacct tt gcacgaagca 1980 
gcccaaaagg gacgaacaca gctttgtgct ttgttgctag cccatggaqc tgacccgact 2040 
cttaaaaatc aggaaggaca aacaccttta gatttagttt cagcagatga tgtcagcgct 2100 
cttctgacag cagccatgcc cccatctgct ctgccctctt gttacaagr-c tcaagtgctc 2160 
aatggtgtga gaagcccagg agccactgca gatgctctct cttcaggtrc atctagccca 2220 
tcaagccttt ctgcagccag cagtcttgac aacttatctg ggagtttttc agaactgtct 2280 
tcagtagtta gttcaagtgg aacagagggt gcttccagtt tggagaaaaa ggaggttcca 2340 
ggagtagatt ttagcataac tcaattcgta aggaatcttg gacttgagca cctaatggat 2400 
atatttgaga gagaacagat cactttggat gtattagttg agatggggca caaggagctg 2460 
aaggagattg gaatcaatgc ttatggacat aggcacaaac taattaaagg agtcgagaga 2520 
cttatctccg gacaacaagg tcttaaccca tatttaactt tgaacacctc tggtagtgga 2580 
acaattctta tagatctgtc tcctgatgat aaagagtttc agtctgtgga ggaagagatg 2640 
caaagtacag ttcgagagca cagagatgga ggtcatgcag gtggaatctt caacagatac 2700 
aatattctca agattcagaa ggtttgtaac aagaaactat gggaaagata cactcaccgg 2760 
agaaaagaag tttctgaaga aaaccacaac catgccaatg aacgaatgct atttcatggg 2820 
tctccttttg tgaatgcaat tatccacaaa ggctttgatg aaaggcatac gtacataggt 2880 
ggtatgtttg gagctggcat ttattttgct gaaaactctt ccaaaagcaa tcaatatgta 2940 
tatggaattg gaggaggtac tgggtgtcca gttcacaaag acagatcttg ttacatttgc 3000 
. cacaggcagc tgctcttttg ccgggtaacc ttgggaaagt ctttcctgca gttcagtgca 3060 
atgaaaatgg cacattctcc tccaggtcat cactcagtca ctggtaqgcc cagtgtaaat 3120 
ggcctagcat tagctgaata tgttatttac agaggagaac aggcttat-^c tgagtattta 3180 
attacttacc agattatgag gcctgaaggt atggtcgatg gataaataqt tattttaaga 3240 
aactaattcc actgaaccta aaatcatcaa agcagcagtg gcctctargt tttactcctt 3300 
tgctgaaaaa aaatcatc'tt gcccncaggc ctgtggcaaa aggataaaaa tgtgaacgaa 3360 
gtttaacatt ctgacttgat aaagctttaa taatgtacag 3400 

SEQ ID NO: 2 
Length: 1074 
Type: PRT 

Organism: Homo sapiens 
Sequence: 2 

He Pro Leu His Asn Ala Cys Ser Phe Gly His Ala Glu Val Val Asn 
1 5 10 15 

Leu Leu Leu Arg His Gly Ala Asp Pro Asn Ala Arg Asp Asn Trp Asn 
20 25 30 

Tyr Thr Pro Leu His Glu Ala Ala He Lys Gly Lys He Asp Val Cys 
35 40 45 

He Val Leu Leu Gin His Gly Ala Glu Pro Thr He Arg Asn Thr Asp 
50 55 60 

Gly Arg Thr Ala Leu Asp Leu Ala Asp Pro Ser Ala Lys Ala Val Leu 
65 70 75 80 

Thr Gly Glu Tyr Lys Lys Asp Glu Leu Leu Glu Ser Ala Arg Ser Gly 
85 90 Q5 

Asn Glu Glu Lys Met Met Ala Leu Leu Thr Pro Leu Asn Va] Asn Cys 
100 105 1 10 

His Ala Ser Asp Gly Arg Lys Ser Thr Pro Leu His Leu Ala Ala Gly 
115 120 125 

Tyr Asn Arg Val Lys He Val Gin Leu Leu Leu Gin His Gly Arg Asp 
130 135 140 

Val His Ala Lys Asp Lys Gly Asp Leu Val Pro Leu His Asn Ala Cys 
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Ser Tyr Gly Hi s 



Cys Val Asn Ala 
180 

Ala Ser Lys Asn 
195 

Ala Asp Pro Thr 
210 

Ala Pro Thr Pro 
225 

His Ser Leu Leu 



Lys His Leu Ser 
260 

Glu Thr Ala Leu 
275 

Gin lie Cys Glu 
290 

Thr Lys Glu Phe 
305 

Asn Asp Val Val 



Leu Asp Asn Leu 
340 

His Leu Gin Thr 
355 

lie lie Ser Leu 
370 

Val Gin Gin Leu 
385 

Asp Arg Gin Leu 



Lys Lys Leu Cys 
420 

Arg Gin Ser Thr 
435 



Val Val Glu Tyr 
450 

Lys Gly Gly Leu 



150 



Tyr Glu Val Thr 
165 

Met Asp Leu Trp 



Arg Val Glu Val 
200 



Leu Leu Asn Cys 
215 

Gin Leu Lys Glu 
230 

Gin Ala AJ a Arg 
245 

Leu Glu Met Val 



His Cys Ala Ala 
280 



Leu Leu Leu Arg 
295 

Leu Thr Pro Leu 
310 

Glu Val Val Val 
325 

Gly Gin Thr Ser 



Cys Arg Leu Leu 
360 

Gin Gly Phe Thr 
375 

Leu Gin Glu Gly 

390 

Leu Glu Ala Ala 
405 

Thr Val Gin Ser 



Pro Leu His Phe 
440 

Leu Leu Gin His 
455 

Val Pro Leu His 



15 



] 55 

Glu Leu Leu Val 
170 

Gin Phe Thr Pro 
185 

Cys Ser Leu Leu 



Lys Asn Lys Ser 
220 

Arg Leu Ala Tyr 
235 

Glu Ala Asp Val 
250 

Asn Phe Lys His 

265 

Ala Ser Pro Tyr 



Lys Gly Ala Asn 
300 

His Val Ala Ser 
315 

Lys His Glu Ala 
330 

Leu His Arg Ala 
345 

Leu Ser Tyr Gly 



Ala Leu Gin Met 
380 

lie Ser Leu Gly 

395 

Lys Ala Gly Asp 
410 

Val Asn Cys Arg 
425 

Ala Ala Gly Tyr 



Gly Ala Asp Val 
460 

Asn Ala Cys Ser 



160 

Lys His G] y Gly 
175 

Leu His Glu Ala 
190 

Leu Ser Tyr Gly 
205 

Ala T 1 e Asp Leu 



Glu Phe Lys Gly 
240 

Thr Arg lie Lys 
255 

Pro Gin Thr His 
27 0 

Pro Lys Arg Lys 
285 

lie Asn Glu Lys 



Glu Lys Ala His 
320 

Lys Val Asn Ala 
335 

Ala Tyr Cys Gly 
350 



Cys Asp Pro Asn 
365 

Gly Asn Glu Asn 



Asn Ser Glu Ala 
400 

Val Glu Thr Val 
415 

Asp lie Glu Gly 
430 

Asn Arg Val Ser 
445 

His Ala Lys Asp 



Tyr Gly His Tyr 
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465 

Glu Val Ala Glu 



Asp Leu Trp Lys 
500 

Tyr Glu lie Cys 
515 



Lys Asn Arg Asp 
530 

Thr Asp lie Gin 
545 

Ala Lys Lys Gly 



Asn Val Asn Cys 
580 

Leu Ala Ala Gly 
595 

His Gly Ala Asp 
610 

His Asn Ala Ala 
625 

Lys Tyr Asn Ala 



Leu His Glu Ala 
660 

Leu Ala His Gly 
675 

Pro Leu Asp Leu 
690 

Ala Met Pro Pro 
705 

Asn Gly Val Arg 



Pro Ser Ser Pro 
740 

Ser Gly Ser Phe 
755 

Glu Gly Ala Ser 
770 

Ser lie Thr Gin 



470 



Leu Leu Val Lys 
485 

Phe Thr Pro Leu 



Lys Leu Leu Leu 
520 

Gly Asn Thr Pro 

535 

Asp Leu Leu Arg 
550 

Cys Leu Ala Arg 
565 

Arg Asp Thr Gin 



Tyr Asn Asn Leu 
600 

Val Asn Ala Gin 
615 

Ser Tyr Gly His 
630 

Ser Leu Asn Ala 
645 

Ala Gin Lys Gly 



Ala Asp Pro Thr 
680 

Val Ser Ala Asp 
695 

Ser Ala Leu Pro 
710 



Ser Pro Gly Ala 
725 

Ser Ser Leu Ser 



Ser Glu Leu Ser 
760 

Ser Leu Glu Lys 
775 

Phe Val Arg Asn 
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475 

His Gly Ala Val 
490 

His Glu Ala Ala 
505 

Gin His Gly Ala 



Leu Asp Leu Val 
540 



Gly Asp Ala Ala 
555 



Val Lys Lys Leu 
570 



Gly Arg His Ser 
585 

Glu Val Ala Glu 



Asp Lys Gly Gly 
620 

Val Asp Val Ala 
635 

Thr Asp Lys Trp 
650 



Arg Thr Gin Leu 
665 

Leu Lys Asn Gin 



Asp Val Ser Ala 
700 

Ser Cys Tyr Lys 
715 

Thr Ala Asp Ala 
730 

Ala Ala Ser Ser 
745 

Ser Val Val Ser 



Lys Glu Val Pro 
780 

Leu Gly Leu Glu 



480 

Val Asn Val Ala 
495 

Ala Lys Gly Lys 
510 



Asp Pro Thr Lys 
525 

Lys Asp Gly Asp 



Leu Leu Asp Ala 
560 

Ser Ser Pro Asp 
57 5 

Thr Pro Leu His 

530 

Tyr I.eu Leu Gin 
605 

Leu Tie Pro Leu 



Ala Leu Leu lie 
640 

Ala Phe Thr Pro 
655 

Cys Ala Leu Leu 
67 0 

Glu Gly Gin Thr 
685 

Leu Leu Thr Ala 



Pro Gin Val Leu 
720 

Leu Ser Ser Gly 
735 



Leu Asp Asn Leu 

750 



Ser Ser Gly Thr 
765 

Gly Val Asp Phe 



His Leu Met Asp 
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785 

lie Phe 



Glu Arg Glu Gin He Thr Leu Asp Val Leu Val Glu Met Gly 
805 810 815 



His Lys 
Lys Leu 



Glu Leu Lys Glu Tie Gly He Asn Ala Tyr Gly His Arg His 
820 825 R30 



He Lys Gly Val Glu Arg Leu He Ser Gly Gin Gin Gly Lei 
835 840 845 



Asn Pro 
850 



Tyr Leu Thr Leu Asn Thr Ser Gly Ser Gly Thr Tie Leu He 



Asp Leu 
865 



Ser Pro Asp Asp Lys Glu Phe Gin Ser Val Glu Glu Glu Met 
870 875 880 



Thr Val Arg Glu His Arg Asp Gly Gly His Ala Gly Gly He 
885 890 895 



Arg Tyr Asn He Leu Lys He Gin Lys Val Cys Asn Lys Lys 
900 9 05 Q 10 



Glu Arg Tyr Thr His Arg Arg Lys Glu Val Ser Glu Glu Asn 
915 920 925 



His Asn 
930 



His Ala Asn Glu Arg Met Leu Phe His Gly Ser Pro Phe Val 
935 940 



Asn Ala 
945 



He He His Lys Gly Phe Asp Glu Arg His Ala Tyr He Gly 
950 955 960 



Phe Gly Ala Gly He Tyr Phe Ala Glu Asn Ser Ser Lys Ser 
965 970 975 



. Tyr Val Tyr Gly He Gly Gly Gly Thr Gly Cys Pro Val Hi: 
980 985 990 



- Arg Ser Cys Tyr He Cys His Arg Gin Leu Leu Phe Cys Arg 
995 1000 1005 



Val Thr 
1010 



Leu Gly Lys Ser Phe Leu Gin Phe Ser Ala Met Lys Met Ala 
1015 1020 



His Ser 
1025 



Pro Pro Gly His His Ser Val Thr Gly Arg Pro Ser Val Asn 
1030 1035 1040 



Ala Leu Ala Glu Tyr Val He Tyr Arg Gly Glu Gin Ala Tyr 
1045 1050 1055 



Tyr Leu He Thr Tyr Gin lie Met Arg Pro Glu Gly Met Val 
1060 1065 1070 



Asp Gly 
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Claims: 

1. An isolated polynucleotide molecule encoding a candidate effector 
protein for the Grb7 family of signalling proteins, wherein the polynucleotide 

5 molecule comprises a nucleotide sequence having at least 75% sequence 
identity to that shown as SEQ ID NO: 1. 

2. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence having at least 

10 85% sequence identity to that shown as SEQ ID NO: 1. 

3. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence having at least 
95% sequence identity to that shown as SEQ ID NO: 1. 

15 

4. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence which 
substantially corresponds to that shown as SEQ ID NO: 1. 

20 5. A host cell transformed with a polynucleotide molecule according to 
any one of the preceding claims. 

6. A host cell according to claim 5, wherein the host cell is a mammalian, 
insect, yeast or bacterial host cell. 

25 

7. A method of producing a protein, comprising culturing the host cell of 
claim 5 or 6 under conditions suitable for the expression of the 
polynucleotide molecule and optionally recovering the protein. 

30 8. A purified protein, encoded by a polynucleotide molecule according to 
any one of claims 1 to 4. 

9. A purified protein according to claim 8. wherein the protein comprises 
an amino acid sequence substantially corresponding to that shown as SEQ ID 
35 NO: 2. 
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10. A fusion protein comprising an amino acid sequence substantially 
corresponding to that shown as SEQ ID NO: 2. 

11. An antibody or fragment thereof which specifically binds to a protein 
5 according to claim 8 or 9. 

12. An oligonucleotide probe comprising a nucleotide sequence of at least 
12 nucleotides, the oligonucleotide probe comprising a nucleotide sequence 
such that the oligonucleotide probe selectively hybridises to the 

10 polynucleotide molecule of any one of claims 1 to 4 xmder high stringency 
conditions. 

13. An oligonucleotide probe according to claim 12. wherein the 
oligonucleotide probe comprises a nucleotide sequence of at least 18 

15 nucleotides. 

14. A method of detecting in a sample the presence of an effector protein 
for the Grb7 family of proteins, the method comprising reacting the sample 
with an antibody or fragment thereof according to claim 11. 

20 

15. A method of detecting in a sample the presence of mRNA encoding an 
effector protein for the Grb7 family of proteins, the method comprising 
reacting the sample with an oligonucleotide probe of claim 12 or 13. 
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FIGURE 1 

ATTCCTCTTCATAATGCATGCTCTTTTGGTCATGCTGAAGTAGTCAATCTCCTTTTGCGACATGGTGCAG 70 
I PLHNACS FGHAEVVWLLLRHGA 



ACCCCAATGCTCGAGATAATTGGAATTATACTCCTCTCCATGAAGCTGCAATTAAAGGAAAGATTGATGT 14 0 
D P N A R D N WNYTPLHEAAIKGKI DV 

TTGCATTGTGCTGTTACAGCATGGAGCTGAGCCAACCATCCGAAATACAGATGGAAGGACAGCATTGGAT 210 
CIVLLQHGAEPTIRNT D G R T A L D 



TTAGCAGATCCATCTGCCAAAGCAGTGCTTACTGGTGAATATAAGAAAGATGAACTCTTAGAAAGTGCCA 280 
LADPSAKAVLTGEYKKDELLE5A 

GGAGTGGCAATGAAGAAAAAATGATGGCTCTACTCACACCATTAAATGTCAACTGCCACGCAAGTGATGG 350 
R S G NEEKMMALLTPLNVNCHAS D G 

CAGAAAGTCAACTCCATTACATTTGGCAGCAGGATATAACAGAGTAAAGATTGTACAGCTGTTACTGCAA 420 
RKSTP LHLAAGYNRVKIVQLLLQ 

CATGGACGTGATGTCCATGCTAAAGATAAAGGTGATCTGGTACCATTACACAATGCCTGTTCTTATGGTC 490 
HGRDVHAKDK GDLVPLHNACSYG 

ATTATGAAGTAACTGAACTTTTGGTCAAGCATGGTGGCTGTGTAAATGCAATGGACTTGTGGCAATTCAC 5 60 
HYEVTELLVKHGGCVNAMDL W Q F T 

TCCTCTTCATGAGGCAGCTTCTAAGAACAGGGTTGAAGTATGTTCTCTTCTCTTAAGTTATGGTGCAGAC 630 
PLHEAASKNRVEVCSLLLSYGAD 

CCAACACTGCTCAATTGTAAGAATAAAAGTGCTATAGACTTGGCTCCCACACCACAGTTAAAAGAAAGAT 700 
P T L L N C KNKSAI DLAPTPQLKER 



TAGCAXATGAATTTAAAGGCCACTCGTTGCTGCAAGCTGCACGAGAAGCTGATGTTACTCGAATCAAAAA 770 
LAYEFKGHSLLQAAREADVTRI KK 

ACATCTCTCTCTGGAAATGGTGAATTTCAAGCATCCTCAAACACATGAAACAGCATTGCATTGTGCTGCT 8 40 
HLSLEMVNFKHPQ THETALHCAA 

GCATCTCCATATCCCAAAAGAAAGCAAATATGTGAACTGTTGCTAAGAAAAGGAGCAAACATCAATGAAA 910 
A5PYPKRKQI C ELLLRKGANINE 

AGACTAAAGAATTCTTGACTCCTCTGCACGTGGCATCTGAGAAAGCTCATAATGATGTTGTTGAAGTAGT 98 0 
K T K EFLTPLHVAS EKAHNDVVEVV 

GGTGAAACATGAAGC AAAGGTTAATGCTCTGGATAATCTTGGTCAGACTTCTCTACACAGAGCTGCATAT 1050 
VKHEAKVNALDN LGQTSLHRAAY 

TGTGGTCATCTACAAACCTGCCGCCTACTCCTGAGCTATGGGTGTGATCCTAACATTATATCCCTTCAGG 1120 
CGHLQTCRLLL5 YGCDPNJ ISL Q 

GCTTTACTGCTTTACAGATGGGAAATGAAAATGTACAGCAACTCCTCCAAGAGGGTATCTCATTAGGTAA 1190 
GFTALQMGNEHVQQLLQEGI SLGN 

TTCAGAGGCAGACAGACAATTGCTGGAAGCTGC AAAGGCTGGAGATGTCGAAACTGTAAAAAAACT GTGT 1260 
SEADRQLL EAAKAGDVET V K K L C 



ACTGTTCAGAGTGTCAACTGCAGAGACATTGAAGGGCGTCAGTCTACACCACTTCATTTTGCAGCTGGGT 1330 
TVQSVNCRDJEG RQ5TPLHFAAG 

ATAACAGAGTGTCCGTGGTGGAATATCTGCTACAGCATGGAGCTGATGTGCATGCTAAAGATAAAGGAGG 14 00 
YNRVSVVEYLL QHGADVHAKDK G G 

CCTTGTAC CTTTGCACAATGCATGTTCTTAC GGAC ATT AT GAAGTTGC AGAACTT C T T GTT AAAC AT GGA 1470 
LVPLHNACSYGHYEVAELLVKHG 

GCAGTAGTTAATGTAGCTGATTTATGGAAATTTACACCTTT ACATGAAGCAGCAGCAAAAGGAAAATATG 15 4 0 
AVVNVADL WKFTPLHEAAAKGKY 



AAATTTGCAAACTTCTGCTCCAGCATGGTGCAGACCCTACAAAAAAAAACAGGGATGGAAATACTCCTTT 1610 
EICKLLLQHGADPTKKNR D G N T P L 
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GGATCTTGTTAAAGATGGAGATACAGATATTCAAGATCTGCTTAGGGGAGATGCAGCTTTGCTAGATGCT 168 0 
D LVKDGDTDI Q DLLRGDAA- LLDA 

GCCAAGAAGGGTTGTTTAGCCAGAGTGAAGAAGTTGTCTTCTCCTGATAATGTAAATTGCCGCGATACCC 175 0 
A K K G C LAR V K K L SSPDNVNCRDT 

AAGGC AGACATTCAACACCTTTACATTTAGCAGCTGGTTATAATAATTTAGAAGTTGCAGAGTATTTGTT 1820 
Q G RHSTPLHLAAGYNNLEVAEYLL 

AC AACACGGAGCTGATGTGAATGCCCAAGACAAAGGAGGACTTATTCCTTTACATAATGCAGCATCTTAC 18 9 0 
QHGADVNAQDK GGLIPLHNAASY 

GGGCATGTAGATGTAGCAGCTCTACTAATAAAGTATAATGCATCTCTCAATGCCACGGACAAATGGGCTT 1960 
GHVDVAALLI KYNASLNATDK W A 

TCACACCTTTGCACGAAGCAGCCCAAAAGGGACGAACACAGCTTTGTGCTTTGTTGCTAGCCCATGGAGC 2030 
FTPLHEAAQKGRTQLCALLLAHGA 

TGACCCGACTCTTAAAAATCAGGAAGGACAAACACCTTTAGATTTAGTTTCAGCAGATGATGTCAGCGCT 2100 
D P T L K N Q EGQTPLDLVSA D DVSA 

CTTCTGACAGCAGCCATGCCCCCATCTGCTCTGCCCTCTTGTTACAAGCCTCAAGTGCTCAATGGTGTGA 2170 
LLTAAMPPSALP5CYKP Q V L N G V 

GAAGCCCAGGAGCCACTGCAGATGCTCTCTCTTCAGGTCCATCTAGCCCATCAAGCCTTTCTGCAGCCAG 22 4 0 
RSPGATADALSSGPSSPSSLSAAS 

CAGTCTTGACAACTTATCTGGGAGTTTTTCAGAACTGTCTTCAGTAGTTAGTTCAAGTGGAAC AGAGGGT 2310 
SLDNLSGSFSELSSVVSSSGTEG 

GCTTCCAGTTTGGAGAAAAAGGAGGTTCCAGGAGTAGATTTTAGCATAACTCAATTCGTAAGGAATCTTG 2380 
ASSLEKKEVPGVDFSITQFVRNL 

GACTTGAGCACCTAATGGATATATTTGAGAGAGAACAGATCACTTTGGATGTATTAGTTGAGATGGGGCA 2450 
GLEHLMDI FEREQI TLDVLVEMGH 

CAAGGAGCTGAAGGAGATTGGAATCAATGCTTATGGACATAGGCACAAACTAATTAAAGGAGTCGAGAGA 2520 
KELKEIGINAYGHRHKLIKGVER 

CTTATCTCCGGACAACAAGGTCTTAACCCATATTTAACTTTGAACACCTCTGGTAGTGGAACAATTCTTA 2590 
LISGQQGLNPYLTLNTSGSGTIL 

TAGATCTGTCTCCTGATGATAAAGAGTTTCAGTCTGTGGAGGAAGAGATGCAAAGTACAGTTCGAGAGCA 26G0 
I DLSPDDKEFQSVEEEMQSTVREH 

CAGAGATGGAGGTCATGCAGGTGGAATCTTCAACAGATACAATATTCTCAAGATTCAGAAGGTTTGTAAC 2730 
RDGGHAGGI FNRYNILKIOKVCN 

AAGAAAC TAT GGGAAAGAT AC AC TC AC C GGAGAAAAGAAGTTTC T GAAGAAAAC C AC AAC C AT GC C AAT G 2800 
KKLWERYTHRRKEVSEENHNHAN 

AACGAATGCTATTTCATGGGTCTCCTTTTGTGAATGCAATTATCCACAAAGGCTTTGATGAAAGGCATGC 2870 
ERMLFHGS PFVNAI IHKGFDERHA 

GTACATAGGTGGTATGTTTGGAGCTGGCATTTATTTTGCTGAAAACTCTTCCAAAAGCAATCAATATGTA 2940 
YI GGMFGAGI YFAENSSKSNQYV 

TATGGAATTGGAGGAGGTACTGGGTGTCCAGTTCACAAAGACAGATCTTGTTACATTTGCCACAGGCAGC 3010 
YGIGGGTGCPVHKDRSCYI CHRQ 

TGCTCTTTTGCCGGGTAACCTTGGGAAAGTCTTTCCTGCAGTTCAGTGCAATGAAAATGGCACATTCTCC 308 0 
LLFCRVTLGKS FLQFSAMKMAHSP 

TCCAGGTCATCACTCAGTCACTGGTAGGCCCAGTGTAAATGGCCTAGCATTAGCTGAATATGTTATTTAC 315 0 
PGHHSVTGRPSVNGLALAEYVIY 

AGAGGAGAACAGGCTTATCCTGAGTATTTAATTACTTACCAGATTATGAGGCCTGAAGGTATGGTCGATG 322 0 
RGEQAYPEYL I TYQIMRP EGMVD 



GATAAATAGTTATTTTAAGAAACTAATTCCACTGAACCTAAAATCATCAAAGCAGCAGTGGCCTCTACGT 
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TTTACTCCTTTGCTGAAAAAAAATCATCTTGCCCACAGGCCTGTGGCAAAAGGATAAAAATGTGAACGAA 3360 



GTTTAACATTCTGACTTGATAAAGCTTTAATAATGTACAG 
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